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BENJAMIN HELLOUIN DE MENIBUS AND MATHIEU SABLIK 



Abstract. The asymptotic behavior of a cellular automaton iterated on a random configuration 
is well described by its limit probability measure(s). In this paper, we characterize measures and 
sets of measures that can be reached as limit points after iterating a cellular automaton on a simple 
initial measure, in the same spirit as SRB measures. In addition to classical topological constraints, 
we exhibit necessary computational obstructions. With an additional hypothesis of connectivity, 
we show these computability conditions are sufficient by constructing a cellular automaton realising 
these sets, using auxiliary states in order to perform computations. Adapting this construction, 
we obtain a similar characterization for the Cesaro mean convergence, a Rice theorem on the sets 
of limit points, and we are able to perform computation on the set of measures, i.e. the cellular 
automaton converges towards a set of limit points that depends on the initial measure. Last, under 
non-surjective hypotheses, it is possible to remove auxiliary states from the construction. 



Introduction 

A cellular automaton is a complex system defined by a local rule which acts synchronously and 
uniformly on the configuration space A z , where A is a finite alphabet. These simple models have a 
wide variety of different dynamical behaviors. We are interested in the typical asymptotic behavior 
starting from a random configuration, as this is usually done in simulations; different approaches 
stemmed from such observations. It is well-described by taking the iterated image of the initial 
measure under the action of the cellular automaton, and considering the limit points of this sequence 
in the weak* topology. 

It is natural to ask which sets of measures can be obtained as limit points in this way. Obviously, 
any measure can be reached by iterating the identity on itself. Therefore, a more interesting ap- 
proach is to start from some simple measure such as the uniform Bernoulli measure. In some sense, 
this corresponds to SRB measures which are "physically" relevant invariant measures obtained when 
starting from the Lebesgue measure in continuous dynamical systems [You02j. 

Formally speaking, given a simple initial measure fj,, we want to characterize all reachable V(F, //), 
the sets of accumulation points of (i^/i^g^, the sequence of the images of fj, under the iterated action 



of F, and V'(F,fj,), the sets of accumulation points of Yli=o^*^j i the Cesaro mean of the 

previous sequence, for all possible cellular automata F. 

Previous works focused on the //-limit set, which corresponds to the union of the support of the 
limit measures |KM00llKu05] . Very complex /x -limit sets can be constructed [BPT06, BD SIO] , and 
our construction is partly inspired from these works. 

Describing limit measures has been done for only few concrete nontrivial examples. There are 
essentially two types of convergence quite well understood: 
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• convergence towards a simple measure: for example, the cyclic cellular automaton on three 
states introduced in [Fis90j, starting from a Bernoulli measure, converges towards a linear 
combination of Dirac on uniform configurations [dMSllJ; 

• randomisation phenomenon for linear cellular automata: the Cesaro mean sequence of the it- 
eration of a linear cellular automaton on a initial measure converges to the uniform Bernoulli 
measure as soon as the initial measure is in a large class which contains Markov mea- 
sures [FMMNOOl IMM981 IPY02] . 

For any cellular automaton, starting from a Bernoulli measure or a Markov measure, we obtain 
after a finite number of steps a hidden Markov chain which is well understood [BPllJ. If we consider 
a computable initial measure jx (which means that there is an algorithm that approximates at a 
known rate the probability that a word u £ A* appears), then it is easy to see that F*fi is also 
computable. For example, a Bernoulli or Markov measure is computable iff its parameters are 
computable real numbers. 

The limit measure(s) are not necessarily computable since the speed of convergence is not known. 
Nevertheless, we show in Section[2]that there exists necessary computational obstructions. The main 
problem is to prove the reciprocal, in other words: given a set of measures satisfying the computa- 
tional obstructions, construct a cellular automaton which, starting on any simple initial measure, 
reaches exactly this set asymptotically. Similar computational obstructions appear when character- 
izing possible topological dynamics properties of subshifts of finite type or cellular automata: possi- 
ble entropies [HM10J, possible growth-type invariants |Meyll| , possible sub-actions [Hoc09, ASTT]... 
However, the construction is quite different here since starting from a random configuration requires 
to self-organize the space, in the same spirit as the probabilistic cellular automaton of [GacOlJ which 
corrects the random perturbations. 

In Section [3l we construct a cellular automaton F such that, starting from any shift-mixing 
probability measure \i with full support, the limit points of the sequence of measures {F\\x)t& are 
described as the accumulation points of a computable polygonal path of measures supported by 
periodic orbits. First of all the cellular automaton divides the initial configuration in segments and 
formats each segment using a method similar to the one developed in [DPSTll] . Computation 
takes place in a negligible part of each segment and the result is copied periodically on the rest 
of the segment. In order to have an arbitrarily large area of computation, segments are merged 
progressively in a controlled manner. The difficulty of the construction is to synchronize all the 
operations to ensure the convergence. 

In Section H] we use the construction of Section [3] to solve some related problems, along with some 
open questions. The results are, for a fixed measure \x in a large class of measures: 

• characterization of shift-invariant measures v such that there exists a cellular automaton F 
which verifies F^fi — > v (Corollary [T]); 

t— >oo 

• characterization of connected subsets of shift-invariant measures V such that there exists a 
cellular automaton F which verifies V(F, [/,) = V (Corollary [2]); 

• characterization of subsets of shift-invariant measures V' such that there exists a cellular 
automaton F which verifies V'(F,fi) = V' (Corollary [3]); 

• characterization of connected subsets of shift-invariant measures V' C V such that there 
exists a cellular automaton F which verifies V(F,[i) = V and V'(F,[i) = V' (Corollary |4|). 

• Rice theorem for shift-invariant measures and connected subsets of shift-invariant measures 
reached by a cellular automaton (Corollaries [U [6] and [7]). 

In Section 14.41 we consider the case where the set of limit points depends on the initial measure. 
Computational constraints appear to describe functions [i i — > V(F, fj.) that can be realized in this 
way. Indeed, it is possible to "transfer" the computational complexity of the initial measure (using 
it as an oracle) to the set of limit points. Modifying the construction of Section [3l we manage to 

2 



build a set of limit points depending on the density of a special state; however, we do not obtain a 
complete characterization. 

In the Section [5J we carry the previous characterizations to the case where auxiliary states are 
not allowed, i.e., the cellular automaton can only use the same alphabet as the limit measure(s). 
This is only possible, however, under some additional hypotheses on the support of the measures. 

I. Definitions 

1.1. Configuration space and cellular automata 

Let A be a finite alphabet. Consider A z the space of configurations which are Z-indexed se- 
quences in A. If A is endowed with the discrete topology, A z is compact, perfect and totally 
disconnected in the product topology. Moreover one can define a metric on A z compatible with 
this topology: 

Vx,y G A z , d c (x,y) = 2- Bn MU !a, **« ieZ \ 

Let DcZ. For x G A z , denote xu 6 A^ the restriction of x to U. Given a pattern w G A^, one 
defines the cylinder [w]u = {x G A z : xu = w}. Denote A* = [j n A n the set of all finite words 
w = wq . . . tu n _i; \w\ = n is the length of to. Also denote [w]i = [w][i t i + \ w \_i] and [w] = [w]o = 

H[o,| m |-i]- 

The shift map cr : A z — > A z is defined by er(x)j = Xj+i for x = (x m ) m g2; G A z and i G Z. 
It is an homeomorphism of A z . Let w £ A*, denote x = 00 w°° the er-periodic word defined by 
x [o,H-i] = w an d & l+ ^(x) = a' L {x) for all i G Z. 

A cellular automaton (CA) is a pair (yl z , F) where F : A 1, — > A z is a continuous function that 
commutes with the shift (a 1 o F = F o<j l for all i G Z). By Hedlund's theorem, it is equivalent to a 
function defined by F(x)i = -F((xj+ u ) ue u F ) f° r & U ^ £ A z and i G Z, where Up C Z is a finite set 
named neighborhood and F : A vp — > A is a local rule. 

1.2. Sets of measures on A z 
1.2.1. Dynamical properties 

Let 53 be the Borel sigma-algebra of A z . Denote by A4(A Z ) the set of probability measures on A z 
defined on the sigma-algebra 23. Let M. a (A z ) be the cr-invariant probability measures on A z : 
that is to say the measures \i G Ai(A z ) such that /j,(a~ 1 (B)) = fi(B) for all B G 23. Trivially, for a 
cr-invariant measure one has //(k\t]j) = /u([it]) for all i G Z. 

Usually .Mct^ 2 ) is endowed with weak* topology: a sequence (/U n ) ng psj of M a (A z ) converges 
to n G M a (A z ) if and only if, for all finite subsets U C Z and for all patterns u G A^, one has 
linin^oo /x n (Mu) = p(Mu)- In the weak* topology, the set A4 a (A z ) is compact and metrizable. A 
metric is defined by 

d M (n,v) = V — max |/i([u]) - 
z — ' 2 n ueA™ 

neN 

Define the ball centered on fi G A4 CT (^4 Z ) of radius e > as 

B(fx,s) = {1/ G M a (A z ) : d M {ii,v) < e} . 

A measure [i G A4 a (A Z ) is u-ergodic if for every cr-invariant borelian subset B G 23 (that is to 
say a~ l {B) = B /x-almost everywhere), one has fJ,(B) = or 1. The set of cr-ergodic probability 
measures is denoted by M a - erg (A z ). Of course M a - erg (A z ) C M a (A z ). 
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For U C Z not necessarily finite, denote by Q3u the cr-algebra generated by the set {[w]u : u G 
„4 U ',U' C U}. Define the weak mixing coefficients of a measure fx G .M CT (.4 ) as 



finite 



^(n) = SUp | ~ 1 : A e ®]-oo,0], B G ®[»,oo[i M(^) > 0, MB) > oi • 

A measure \x G ^^(^l 2 ) is ■(/'-mixing if Vv( n ) — > 0. Denote A / f -_ m i x (w4. z ) the set of ^-mixing 

measures, of course M. a -mix{A z ) C 7W,j-erg(^4 Z )- 

For a measure /i G ^(.A 2 ), define supp(^), the support of fi, as the set of configurations of A z 
such that any open neighborhood of these points have positive measure. Denote M^ eTg {A z ) the 
set of ergodic measures with full support, and -M^z^^A ) the set of ^-mixing measures with full 
support. 



1.2.2. Classical examples 

Let A = (A a ) G [0; l]" 4 such that YlaeA ^ a = ^' ^he associated Bernoulli measure fi\ is defined 
by 

/j,\([uq . . . u n \) = X Uo ■ ■ ■ X Un for all u = u ■ ■ ■ u n G A*. 
Let x G A z . The Dirac measure supported by x is defined by 

if x [u] 

1 if x G [u]. 



Generally 5 X is not cr-invariant. However, if x is cr-periodic, it is possible to define the cr-invariant 
measure supported by x taking the mean of the measures 8 c i< x y Thus, for a word w G A* , we define 
the cr-invariant measure supported by 00 w 00 by 

W = 1 7 / . 5 iloa w co\. 

\w\ K ' 

1 1 ie[o,M-i] 

The set of measures <^6 W : w G ^4*| is dense in A4 a (A z ) |Pet83j . 
1.2.3. Action of a cellular automaton on M. a (A 1 ') and limit points 

Let (A Z ,F) be a cellular automaton and fi G M a {A 1 '). Define the image measure F*/a by 
F^/j,(A) = /i(i ?_1 (^4)) for all A G 53. Since F is cr-invariant, that is to say F o a = a o F, one 
deduces that F*fi G M CT (A Z ). We consider the continuous application F* : M a (A z ) — > M a (A z ). 
Moreover if fi G M a - erg {A z ) then G M a -er S (A Z ). 

We consider (F*[i) the iteration of F on ju, and its Cesaro mean at time t G N defined by 



2=0 

For a measure ^ G A4 a (A z ), we are interested in the asymptotic behavior of the sequences 
(F^fi)teN and (ifffj,) te ^. Define V(F,fi) the set of cluster points (or limit points) of the se- 
quence (F*//)t £ N and V'(F,fj,) the set of cluster points of the sequence (ipffj,) te ^. Since 
M a (A z ) is compact, V(F, fi) and V'(F,fj,) are nonempty. When V(F,fi) is a singleton {z/}, then 
— ^ KM)- This corresponds to the intuitive notion of asymptotic behavior as observed 

t— >oo 

in simulations. 



Our main purpose is to characterize which sets of measures can be realised in this way. There 
are topological obstructions for these sets: V(F,fj.) and V'(F,fj,) are closed and thus compact, and 
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it is well known that V'(F,fj) is connected. There are also computability obstructions: we study in 
the next section the case where the initial measure is computable, and this will be generalized in 
Section H20 



2. Computability obstructions 

2.1. Notion of computability 

Definition 1. A Turing machine TAi = (Q,T,#,qo,5 : QF) is defined by: 

• r a finite alphabet, with a blank symbol # ^ T. Initially, a one-sided infinite memory tape 
is filled with except for a finite prefix (the input), and a computing head is located on 
the first letter of the tape; 

• Q the finite set of states of the head; go G Q is the initial state; 

• 5:QxT^QxTx {<— , • , — >} the transition function. Given the state of the head and 
the letter it reads on the tape — depending on its position — the head can change state, 
replace the letter and move by one cell at most. 

• Qf C Q the set of final states — when a final state is reached, the computation stops and 
the output is the value currently written on the tape. 

A function / : X — > Y with X and Y two enumerable sets is computable if there exists a Turing 
machine that, up to reasonable encoding, stops and returns f(x) on any entry x £ X. 



2.2. Measures and computability 

Definition 2. A measure ju G M. a (A z ) is computable iff there exists / : A* x N — > Q computable 
such that 

\fi([u])-f(u,n)\<2- n . 

A sequence of measures (//j)j g N is computable iff there exists / : x N x N - >• Q computable 
such that — f(u,n,i)\ < 2~ n . In particular all fii are computable. 

A measure [i G J\A a (A^) is recursively computable iff there exists an computable sequence of 
measures (^/^)^p^ such that lim^— Hi = /i. Equivalently there exists / : A* x N — > Q computable 
such that 

KN)-/(u,n)| -> 0. 

n— ¥oo 

Denote A4'^ >mp (A 1 ') the set of computable measures and Ai T a COVCiV \A Z ) the set of recursively 
computable measures. Of course A4% >mp (A 1 ') C A4a Comp (A z ). There exists an equivalent way to 
define these notions: 

Proposition 1. A measure fi G M. a {A I ") is computable if and only if there exists f : N — >• A* 
computable such that dju (fj,,Sf^J < 2~ n for all n G N. 

A measure [i G A4o-(A z ) is recursively computable if and only if there exists f : N — > A* com- 
putable such that lim 8tr n ) = fi. 

Proof. Let /i G Ai™ mp (A Z ). Since the set ^5 W : w G ^4*| is dense in A4 a (A^) and is enumerable, 

we enumerate it until we find w G A* such that — <5u,([it])| < 2~ n ~ 2 for all u G A k with 

k G [0, n + 1]. This is possible since /j, is computable. One has 

d^(/x,<U = ^^maxKM)-KM)l < Ji ^ 2^" 

i£N " i>n+2 



Let fj, G Ada (A ). There exists a computable sequence of measures such that 

lmij^oo Hi = jjL. By the previous point we get a word f(n) G *4* such that d^iUm 8 f(n)) — 2~ n for 
n G N. Clearly / : N — > A* is computable and eZ/n^, <5f( n \) < dj^i^, Mn) + dM{f J "n,o~f(n)) — ^ 0- 

In both cases, the reciprocal is obvious. □ 



The set of measures supported by periodic orbits is dense in Ai a (A z ). We will need a more 
precise statement that the rate of convergence can be bounded by a computable sequence. 

Proposition 2. There is a computable sequence at — > such that for all fc£N, one has 

fe^oo 

M(A Z )C (J B(Su,a k ). 
ueA k 

In the following proof, all logarithms are in base |„4|. 

Proof. Take \x G M(A Z ) and n G N. For each u; G .A™, define G N such that \p w - fJ,([w]) ■ \A\ 3n \ < 
1 and J2 w eA"P w = l-4| 3n - Now consider the following generalisation of the de Bruijn digraph with 
multiedges and loops, noted G = (V,E). The set of vertices V is A n_1 , and for all u, v G V: 

• if there are a, b G A and w G A n ~ 2 such that u = aw and v = wb, then there are p aw b edges 
from u to v labeled b; 

• otherwise, there are no edge from u to v. 

Intuitively, from an Eulerian cycle in this graph we could build a periodic word where the fre- 
quency of each word u G A n is rfpr; however, this graph is not necessarily Eulerian. We can turn 
it into an Eulerian graph by adding a small number of edges. 

We denote d + (u), resp. d~(u) the outdegree, resp. indegree of the node u. By definition, 
d + {u) = Y^aeAPau so that \^{[u])-\A? n -d + {u)\ < *£ a eA MM) • l-4| 3n ~Pau\ < \A\, and similarly 
for d~(u). We note I(u) = d + {u) — d~(u) the imbalance of the node u. Notice that \I(u)\ < 2\A\ 

and Y.uev 1 ^) = °- 

Suppose there exists u, v G V such that I(u) < 0, 1{v) > 0. There is a sequence of vertices 
no, . . . ,Ui with i < n — 1, uo = u and u; L = v, such that we can add a labeled edge between two 
successive vertices without breaking the edge condition defined above. By doing so we increse I(u) 
by one and decrease I(v) by one without affecting other vertices. By repeating this operation, we 
bring the imbalance of every vertex to with less than 2(n — 1)\A\ • |-4| n_1 additional edges, that 
is, we get an Eulerian graph. Denote G' this augmented graph and p' w the multiplicity of the edge 
corresponding to w G A n . 

From an Eulerian cycle in G' we build a word tt such that | vr | = ^2 we ^ n p' w < |„4| 3ra +|.4.| n+log ( 2ri ) < 
|„4| 3n+1 , and we consider the measure supported by its periodic orbit. For w G A n , we have by 



construction <5„. ([«;]) 



)-ia6i" P'u 



-, so that: 



MM) - MM) 
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3n 



\A\ 
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|^|log(2n)-2n 



1 



P'w 



Pw + \p'w~Pw\ + 1 
+ |^|™+ lo s(2«) + 1 



1 



|^4|2n-log(2n) 
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and for any k < n, if we consider w 6 A k , \5 n ([w]) — /i([w])\ < ^| n+fc 1 : log(2n) , Consequently: 

^, / ,)=^maxM^M 



— 2-^/ 2 k M|n+fc-log(2n) 2™ 
1 



< 



|_4|n-log(2ra) 

and |7r| < |„4| 3n+1 . Since such a it exists for all fj,, we can choose = |.4|~L — S J+ lo s( 2 L 3 J) ; 
which is computable. □ 

2.3. Action of a cellular automaton on computable measures 

Proposition 3. Let (A Z ,F) be a cellular automaton. If fj, € M.a° mp (A 1 ') then (F*^)^ is a 
computable sequence of measures. In particular, if F\\x — > v then v € M r a comp (A 1 *) . 

t—>-oo 

Proof. Suppose \A\ = 2 to simplify the proof. By definition, there is a computable function / : 
A* x N — > Q such that |/x([tt]) — f(u,n)\ < 2~ n . The function F is defined locally; in particular, 
if Uf C [l,r], F t (x)^ ^ depends only on Xm^^y In other words, for all u £ A k , there is a set 
Pred 4 (u) C A^ rt+k ^ such that F~*([«]) = U 

uePred t («)H' Now consider the function 
f':(u,n,t)^ Yl f(v,2n+(r-l)t). 

t)GPredt(«) 

It is computable by enumerating elements of J[ k +( r - l ) t an d checking if F t ([v]-u) C [u] by iterating 
the local rule on v. Finally 



|F*/x(M) - f'(u,n,t)\ 



/'I U Ml- E /(«,2n+(r-0t) 

uGPred t («) / t>£Pred t (?i) 

1 E HM)-/(",2n + (r-Z)t)| 

^GPredi(u) 
^ 2 n +( r ~0* . 2 _ ^n— (r— Z)t 2 _ n 

which means that (i ? */u)t e N is a computable sequence of measures. □ 
We wish to extend this computability obstruction to sets of limit points. 

2.4. Compact sets in computable analysis 

We introduce computability notions on compact sets. This is a part of the theory of computable 
analysis on metric spaces for which a standard reference book is [WeiOOj. 

In a metric space, a compact set V is characterized by the function dy : [A i— > inf^gy oI_m{v, jj). 
Thus the computability of a compact set can be defined by the computability of its distance function. 

Definition 3. A sequence of functions (f n : M. a {A 1 ') — > K) n eN is computable if 

• there exists a : N x N x A* — > Q computable such that f n (6 w ) — a(n, m,w) < ~ for every 
w £ A* and n, m E N (sequential computablity); 



• there exists b : N — > Q computable such that <i_A/((u, v) < b(m) implies \ f n (n) — fn( v )\ < ~ 
for all n, m G N (computable uniform equicontinuity) . 
A function / : A4 a (A z ) — > R is E2-computable if there exists a computable sequence of 
functions (/ n : M. a (A 7 ') — > R) n eN such that / = liminf f n . 

n — >oo 

A compact set V C M a (A z ) is ^-computable if dy : M a (A z ) — > R is Incomputable. 

If such a set is reduced to a singleton {v} then v G Ai r a corav '(A z ) and reciprocally. 

Remark 1. Classically a function / : A4 a (A z ) — > R is computable if it satisfies Definition [3] 
without dependency on the index of the sequence, and a compact set V C A4 a (A 1 ') is computable 
if dy is computable. In this case it is decidable whether a finite set of balls is a cover of V. 

The name "^-computable" comes from an analogy with the real arithmetic hierarchy |ZW0H 
IZie05] 

Proposition 4. Let (A Z ,F) be a cellular automaton and \i G M^ omp (A z ). Then V(F, /i) and 
V'(F,fi) are E2- computable compact sets. 

Proof. Let /„ : v 1— > dyv/t (F™fM, v) . Since \i is computable, (/„) n gN is sequentially computable. 
Moreover \f n {y) - f n (i/)\ = \d M {F*n,v) - d M {F™ ^,v')\ < d M (u,u') so (/„) n eN is computably 
uniformly continuous. The result follows from the fact that dyrp^iy) = liminf n _ s>00 d_\4(F™[i, v). 
The same reasoning holds for V'(-F, u). □ 

2.5. Technical characterization of E 2 -computable compact connected sets 

To realise a ^-computable compact set of measures V by a cellular automaton, we need a recursive 
enumeration of words (w n ) n( zfq which describes V in a certain way. However, the limit points of the 
construction presented in Section [3] are a connected set, because it builds an infinite path composed 



of segments of the form 



0~u,0~v 



[t5 u + (1 - t)5 v : t G [0, 1]} C M a {A z ) where u,v G .4*. The 
following proposition describes how such connected sets can be covered by a polygonal path. 

Definition 4. Let (u> n ) ng N be a sequence of words of A* . Denote V((w n ) ne pj) the limit points of 
the polygonal path defined by the sequence of measures (d Wn ) n ^fq 



iV>0n>iV 

Proposition 5. Let V C M (7 (B 1 ') be a non-empty ^-computable, compact, connected set (T,2-CCC 
for short). Then there exists a computable sequence of words (w n ) n ^fq such that V = V((w n ) ne fq). 

Proof. By definition, there is a computable sequence of functions (/ n )neN satisfying <iy = lim inf ne pj f n , 
and let a : N x N x A* — > Q and b : N — > Q be the computable functions given by Definition |3] 
Define A^ k = Ui< fe ^- Clearly A^ k C A^ k+1 and UfceN^ : w G is dense in M a (A z ). 

Now define V| = {w G A- k : 3t' G [k,t] such that a(t',t,w) < 2af.\, where the computable se- 
quence (afc)fceN is defined in Proposition [2j For each t, we compute the elements of V| for k < t 
by increasing k. The sequence w n is built algorithmically in the following way: 

Suppose wo . . . w n have already been computed and we begin to compute \\. for k < t. When 
an element w of that was not in Vj, -1 is computed, we want to add it to the sequence. Let 
i G [0, k] be the maximal value such that U ugV t B((5 U , «j) has a connected component containing 

5 Wn and 5 W . We can find with a simple graph algorithm a sequence uq, . . . ,Uj G V* such that the 
path [8 Wn , S Uo ] U • • • U [5 Uj , 5 W ] is included in it. The words no, . . . ,Uj,w are added to the sequence 



Now we will prove the correctness of this algorithm. 
Claim 1: If // € V, then fi G V{(w n ) ne n) 

Proof. There is a sequence of words (« n ) ng N such that <ijn (<5 Un , /x) < a n . Since dv(8 Un ) < a n , 
there is a i > such that ft{8 Un ) < Let t n > t satisfying j- < Then a(t,t n ,u n ) < 

ft{°~u„) + < 2ajt, which means that each u n is in V*™ |, and by construction it appears at 
some point in the sequence (w n )neN- ^ Claim 1 

Claim 2: Vife, 3T fc , Vt > T fc , V* = V^ fc . Furthermore, we have V C U ^r k B(8 U , a k ). 

Proof. For all k, Vt is a discrete increasing bounded sequence (in the inclusion sense), so it is 
stationary. For each element u, G V there is an element u k G „4- fc such that cLvj(/i, <5 Ufc ) < o^, 
and therefore dv(<5 Ufc ) < Consequently, G V| for t large enough. O claim 2 

CLAIM 3: Let u, v G „4- fe . If dv(<5 u ) < Ofc> ^v(^) < a fc> then one can find a path u 

110,11!,..., Uj-^Uj =v G Vl k such that Uie[o,j-i] <Wi] c {/^ : d v(» < <*fc} ■ 

Proof. One has u,v £ V^ fe . Let z/, 1/ G V be such that d{8 u ,v) < a k and d(8 v ,v') < a*,. 

By Claim 2, U ueV T fc B(<5 u ,afc) contains V, which is connected, so there is a path from v 

to v 1 in 1J r fc B(u, afc). Thus there exists uo,...,Uj G VjT* such that uo = u, Uj = v, 
dM(5ui,5u i+1 ) < a k and d v (8 Ui ) < a k for i G [0,j - 1]. Finally, 

|J feAi+i] C : dv(A») < "fc} • 
ie[0j-i] 

O Claim 3 

Claim 4: Ve > 0,3n e ,Vn > n e , we have V/i G g?v(aO > 3e =>■ > e. 

Proof. Let /i G M. a (A L ) such that > 3e. 3n £ (/i),Vn > n £ (fi), f n (n) > 2s. By taking 

fc e G N such that ^ < e, we have by computable uniform equicontinuity of (/ n )neN /n( iy ) > £ 
for all f G B(/j, 6(fc e )) and all n > n e {a). 

Since {^i G M a (A Ij ) : dy(/j,) > 3e} is compact, we can cover it with a finite number of 
balls of radius b(k e ), and we define n £ the maximum value of n e (/i) on ball centers. Thus, 
Vn > n e , V> G M a (A z ), dy(fi) > 3e / n (/x) > e. O ciaim 4 

Claim 5: Jffi^ V, then u, V((^„)„ g n)- 
Proof. Let /i g" V and consider e > such that 2e < dy(n). Take fc' large enough such that 
a k ' < s, and k large enough such that 9a k < a k '- Denote n^ ak the integer obtained in Claim 
4. 

Take t > m&x(T k , -^-,n^ ak ), and let w be a word added by the algorithm in the sequence 
(u> n )neN after time t. Since t > T k , there exists t' G such that a(t',t,w) < 2a k - Thus 

one has ft'{$ w ) < 2afe+| < 3afc. Since t > n^ ak , by Claim 4 we deduce that dy(8 w ) < 9a k < s. 
Since t > T k > T k >, by Claim 3, all paths added in order to reach w are in {u, : dy(/j,) < a k }. 
Thus, for N G N large enough: 



B(ji,e)n [J [5 Wn ,8 Wn+1 

n>N 

which means that fj, V((w n ) ne n). O Claim 5 



3. Construction of a cellular automaton realising a given set 

of measures 



We want to prove a reciprocal to Proposition [3] and a partial reciprocal to Proposition U using 
Proposition [5) Given a computable sequence of words (io n )neN in B* , we construct a cellular 
automaton realising V((u? n )neN) as its set of limit points. 

Theorem 1. Let {w n ) n ^ be a computable sequence of words of B* , where B is a finite alphabet. 
Then there is a finite alphabet AD B and a cellular automaton F : A z — > A z such that: 

• for any measure /i G M f ^ mix (A I ') ! V(F,n) = V((u>„)„ 6 n)- 

• if V((u> n ) n6 N) = {v}, then for any measure \i G M^_ ero {A 1 "), F*fi — > v. 

Furthermore we get an explicit bound for the convergence rate in the first point of the theorem: 

dM(F^,V((w n ) neN ) < O ( — -r J +SUp < d M (v, V{{w n )nm)) ■ ^ S (J [<W,<WJ>, 

g [ n>n(t) J 

where n(t) = 0(log(t) 2 ) and assuming that w n is computable in space \Jn. 

In the rest of the section, we detail the construction of this cellular automaton and prove this 
theorem. 



3.1. Sketch of the construction 

In this section, we present the general framework to build the alphabet A and the cellular automa- 
ton F. Our goal is to compute each w n successively and write concatenated copies of it on the 
whole configuration. A contains a symbol [w] (for wall) persisting in time, except under special 
circumstances; the idea is that to the left of each wall there is a small computing area where w n is 
computed, and then it is copied repeatedly to its left until the next wall. 

A main issue is to initialize the computing synchronously for each wall, even though we have no 
control over what cells appear at time 0. To do this, we define another symbol [TJ (init), which 
appears only in the initial configuration, creating a wall while erasing the contents of neighboring 
cells and initializing computation on its left. The resulting wall is said to be initialized. 

Definition 5. Let x G A 7 ". is a segment at time if X{ and xj are two consecutive |Tj 

symbols, and a segment at time t if F t {x)i and F l (x)j are two consecutive initialized walls |w| ■ 
Define the length of this segment as i — j — 1. 

In all the paper, t is the current time and (T n ) G N N an increasing sequence of times that will be 
fixed later. Intuitively, T n is the time when each machine has computed w n and starts copying it 
in its left segment. 

Since we have no control over the initial contents of each segment, we want to destroy non- 
initialized walls to guarantee that synchroneous computing takes place everywhere. 

To do that, we use the following method which is represented in Figure [TJ On its left, each 
initialized wall keeps the value of current time under the form of a binary counter incrementing at 
each step (time counter), and sends another incrementing counter to its right progressing at speed 
one (sweeping counter). Time and sweeping counters already present in the initial configuration 
(not initialized) have a positive value, whereas those created by an [TJ symbol (initialized) have 
value at time 1, and they increment at the same rate. Thus, non initialized walls have older time 
counters, and by comparing time counters and sweeping counters as they meet, we can erase older 
counters and non-initialized walls as well. 
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see section 13.2.11 



FIGURE 1. Sketch of the bootstrapping and sweeping processes. Vertical lines are 
walls. White areas and grey areas are swept and non-swept, respectively. Dashed 
parts contain time counters (section I3.2.3j) and Turing machines (section 13.3. 1[) . 
Slanted lines are sweeping counters (section l3.2.4p . 

In order to enlarge computational space and decrease the density of auxiliary states, segments 
of length n are merged with their left neighbour at time T n . To determine the length of its right 
segment, each wall sends a signal to the right that bounces off the next wall and counts the return 
time. The Figure [2] is an overview of copy and merging processes. 



time 

Newer merging signals 
erase older signals 




n 



FIGURE 2. Sketch of the copying and merging processes. We suppose all walls are 
initialized. Slanted thick lines are copy processes (see section |3.3.2|) . slanted dotted 
lines are merging signals (see section \3A\i . 
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Thus the considered alphabet consists in different layers A = {[TJ, |W| } U -4 utput x A 



comp 



X 



Atime X -^sweeping ^ *^copy ^ Emerge? where. 

• [TJ and |w| are the two above-mentioned symbols; 

• -^output = B U {#} is the layer on which w n is output and then recopied; 

• -4comp is the layer where computing takes place by simulating Turing machines; 

• At i me is the layer on which time counters are incremented; 

• -^.sweeping is the layer on which sweeping counters move and are incremented, and where 
comparisons are done; 

• A CO py is an auxiliary layer used in the process of writing copy the output; 

• Anerge is an auxiliary layer used in the process of merging two segments. 

All those alphabets contain a symbol # (blank) representing the absence of information. If u G A, 
note output(u), resp. comp(u), time(u). . .the projections on each layer (the result being # on [T| 
and |w| )• We have B C A up to the identification b i-> (6, jf). 

We shall detail the different alphabets in the following sections. As we will see, our construction 
needs interactions at a distance at most three, so we can take U_p = {—3, . . . , 3} as the neighbour- 
hood of F. 



3.2. Formatting the segments 

3.2.1. Bootstrapping 

If two symbols [JJ are separated by two cells or less, the rightmost one is destroyed. Otherwise, 
any [TJ symbol turns into a [w] , erasing the contents of three cells to its right and left (including 
walls), initializing on its left a computation and a time counter, and on its right a sweeping counter. 
No more [TJ or [w] symbols can be created. 

Walls, counters and computing areas created in this way are initialized, by opposition to those 
already present at time 0. Walls persist over time and are only destroyed under two circumstances: 

• when it is in a situation such that it is impossible that it is initialized (e.g. without a time 
counter to its left); 

• at time T n , if it is the left bound of a segment of length n. 

If a segment is of length three at time 0, then the time counter of the rightmost wall is erased at 
time 1 and the wall itself is destroyed at time 2. Thus segments have minimum length four from 
time 2 onwards. 

3.2.2. Counters 

All counters are binary in a redundant basis, so that they can be incremented by one at each step 
(keeping track of current time) in a local manner. 

Definition 6 (Redundant binary). Let u = u n -\ ...uqE{0, 1, 2}*. The value of u is 

n 

val{u) = Ui2 % . 
i=i 

Since the basis is redundant, different counters can have the same value. 
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Definition 7 (Incrementation). The incrementation operation inc : {0, 1, 2}* i— > {0, 1, 2}* is defined 
in the following way. If U\ u \—\ = 2, then \inc(u)\ = \u\ + 1, \u\ otherwise, and: 

1 si i = \u\ + 1 and = 2; 

inc(u)i = < Ui mod 2 + 1 if i = or Uj_i = 2; 
U{ mod 2 otherwise. 

Intuitively, the counter is increased by one at the rightmost bit and 2 behaves as a carry prop- 
agating along the counter. When the leftmost bit is a carry, the length of the counter is increased 
by one. Thus: 

Fact 1. val(inc(u)) = val(u) + 1. 

This operation is defined locally and can be seen as the local rule of a cellular automaton. 
3.2.3. Time 

We use the alphabet -4. t ime = {0, 1, 2, In a configuration, a time counter is a word of maximal 
length containing no # in the time layer. A time counter is attached if it is bounded on its right 
by a wall [w] , detached otherwise. 
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FIGURE 3. A detached time counter, and a time counter attached to an initialized 
wall. Only the layer «4.time is represented. ? cells have arbitrary values. 

At each step, attached counters are incremented by one while detached counters have their right- 
most bit deleted (see Figure [3]). Indeed, detached counters cannot be initialized and can be safely 
deleted. Formally, 

• if u\ = [w] , then time(F(u)o) = time(uo) mod 2 + 1; 

• if time(ui) = #, then time(F(u)o) = #; 

• otherwise, follow the incrementation definition (Definition [7} . 

When a counter increases in length, it can erase a wall. However, this is not a problem, as we 
shall see in Facts [2] and [6j 

Fact 2. An initialized wall cannot be erased by a detached time counter. 

Proof. A detached counter is not incremented and can extend by one cell at most because of the 
carries initially present in the word. But IT] symbols erase two cells to their right at initialization. □ 
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Fact 3. Each attached time counter u in F l (a) satisfies val(u) > t — 1, the equality being attained 
if this counter is attached to an initialized wall. 

Proof. No time counter is created except at t = 1 (by \T\). Therefore such a counter was present 
either in the initial configuration (with a nonnegative value), or was created at t = 1 by a [TJ symbol. 
It is incremented by one at each step in both cases. □ 

Thus we can use time counters to tell apart initialized walls from non-initialized walls, which will 
be the object of the next section. 

3.2.4. Sweeping and comparisons 

Sweeping counters are defined and incremented at each step in a similar way as time counters, but 
they have a range of different behaviors. The alphabet ^sweeping is decomposed into two layers 
Estate and lvalue- A sweeping counter is a word of maximal length of state different than The 
possible states of the counter are: 

"Go" state: The counter progresses at speed one to the right. 

"Stop" state: Once a wall is encountered, the counter progressively (right to left) stops. 
"Comparison" states: Once the whole counter has stopped, we locally compare the sweeping 
counter and the time counter, left to right, with a method we will describe later. 

The wall is destroyed if the sweeping counter is strictly younger, and the sweeping counter is de- 
stroyed otherwise (see Figures [5] and [6]) . In the former case, the counter progressively comes back 
to the "Go" state. 
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FIGURE 4. One initialized and one uninitialized sweeping counter. X symbols mark 
the cells where values are prevented to appear to avoid merging: the right counter 
is dominated. Only the layer ^sweeping is represented. 

Changing state takes some time to propagate the information along the counter. Therefore, coun- 
ters passing from a "Go" state to a "Stop" state are temporarily in a situation where the left part of 
the counter progresses whereas the right part does not. To avoid erasing information, counters in 
a "Go" state have buffers, i.e. the value of the counter is only written on half the cells, the other 
being erased when changing state (see Figured]). 
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When its length increase, a counter will never merge with another counter, instead erasing bits 
from the right-hand counter to avoid merging: we say the right-hand counter is dominated. Notice 
that it is impossible for a counter located at the right of another counter to be initialized, and so it 
is safe to erase bits of it. 

Fact 4. Any non- dominated sweeping counter u of F t {x) satisfies val(u) >t—l, the equality being 
attained if the counter is initialized (from a [Y\ symbol). 

Proof Similar to Fact [3l □ 

Thus, we guarantee that an initialized (hence non-dominated) sweeping counter is strictly younger 
than any non-initialized wall, and symmetrically. As for dominated counters, whose value is arbi- 
trary, we will see that they are erased before any comparison takes place. 

Definition 8 (Comparison method). Let u = uqU\ . . . and v = vqVi ... be two counters in re- 
dundant binary basis (adding zeroes so that |u| = \v\). Let us note sign(u — v) the result of the 
comparison between u and v, that is, +, or — . 

Case 1: if \u\ = \v\ = 1, sign{u — v ) = sign(uo — vq); 
Case 2: if uq + L n i/2j > vq + L^i/2j , then sign(u — v) = +, 
and symmetrically; 

Case 3: if uq + \ u\/2\ = v o + , then sign(u — v) = sign{u' l U2 ■ ■ ■ — v[v2 ■ ■ ■ ), 

where u\ = u\ mod 2 and v[ = Vi mod 2. 

In other words, we do a bit-by-bit comparison starting from the leftmost bit, considering that 
# = 0, and taking into account the carry propagation "in advance", so that the incrementation 
and carry propagation can continue during the comparison. If the result can be determined locally 
(cases 1 and 2), the state is changed to + or — , and it will propagate to the right along the counter. 
Otherwise (case 3), the state changes to =, which means future bit comparisons will decide the 
result in the same way (see Figure [6]). 
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FIGURE 5. A younger sweeping counter encountering an older wall. Only the state 
layer of «4 sweep i n g is represented, with greyed words for buffers. 
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FIGURE 6. The comparison process in detail. Here the sweeping counter is older 
than the wall and is destroyed. Only the layer ^sweeping is represented. 



After the comparison, two cases are possible: 

• if the state of the rightmost bit is — , the wall is destroyed and the state of the rightmost bit 
becomes Go. The counter then progressively resumes going forward. 

• if the state of the rightmost bit is + or =, it is erased. The remaining bits are progressively 
erased similarly to detached time counters. 

Notice that if the counter is dominated, then its leftmost bit is erased at each step, preventing the 
comparison to start, until the counter is entirely erased. 

Finally, the alphabet ^sweeping can be written as {#} U {Go} x {0,1, 2, U {Stop, +, — , 0} x 
{0,1,2}. 

When a sweeping counter reaches the right wall of the segment, the segment is said to be swept. 
This implies no (non-initialized) walls remain in the segment. 

Fact 5. At time k(l + [log A;]), all segments of length k are swept. 

Proof. As long as t < k(l + [log A;]), any initialized sweeping counter has less than 2 [log A;] cells 
containing a value. The counter progresses at speed one except when it meets another wall. Each 
comparison takes a time equal to twice the current length of the counter. Furthemore, two consec- 
utive walls are separated by three cells at least (cf. Section f 3 . 2 . 1 1) . Thus, the segment is swept in 
less than k + | • 2 • 2 [log Af| steps, and we can check that t < k(l + [log ft] ) . □ 

Fact 6. An initialized wall cannot be erased by a time counter attached to a non-initialized wall. 

Proof. Consider two walls separated by k > 3 cells, the left being initialized and the right non- 
initialized. The value of the time counter attached to the right wall cannot exceed 2 k ~ 3 at t = 1 
(since [TJ erases three cells to its right), it will take more than 2 k — 2 fc_3 steps before the left wall is 
erased. According to Fact [5j the right wall will be destroyed in less than k(l + [log A;]) steps, and 
the time counter will take at most k more steps to be erased. 

For k > 5, A;(l + log A;) + k < 2 k — 2 fc ~ 3 , so the counter is erased before it reaches the left 
wall. For k = 4, there cannot be another wall between them, so the destruction time is actually 
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< k + 2 log k + k > 2 k — 2 k 3 . For k = 3, the time counter of the right wall is erased at t = 1 by 
the left wall, so the right wall is immediately destroyed. □ 



3.3. Computation and copy 
3.3.1. Computation 

Computation takes place in the area delimited by the time counter attached to the right wall. -4 C omp 
is divided into three layers, on which three Turing machines = {Qi, Tj, q$ , Si, QfJ are 

simulated. We use the alphabet A CO mp = ®iLi x {Qi U #)• For i = 1, 2, 3, compi is the projection 
on the corresponding layer, and we identify ((#, (#, (#, #)) with #. 

On each layer, the left part contains the content of the tape, and the right part contains the 
state of the machine in exactly one cell (to represent the head), and # everywhere else. If a cell is 
outside a time counter, the computation layer is deleted. The evolution of the tapes is governed by 
the rules of the machine, i.e.: 

• if time(uo) = # then compi{F(u)o) = 

• if the head is on no and 5(compi(u)) = (9,7, •), then compi(F(u)o) = (9,7); 

• if the head is on u\, 5(compi{ui)) = {q, 7, «— ) and no = (#,7'), then compi{F(u)i) = (#,7) 
and compi(F(u) ) = (g,V); 

• similarly if the head is on n_i and <5(compj(u_i)) = (g, 7, — >■); 

• otherwise, compi(F(u)o) = compi(uo). 

We now describe the operations expected to be performed during the time interval [T n _i,T n ]. 
Sometimes a machine can read its input or write its output on another layer. Suppose that layer 1 
receives n as input. The machines: 

• replace n by n + 1 on layer 1; 

• compute w n on layer 2, outputting it on the output layer; 

• compute T n on layer 3; 

• when t = T n (t being written on the time layer), it initializes the next computation and the 
copy, or the merging with its right segment. 

Copy and merging are detailed in next subsections. 

All these operations must be performed in less than T n — T n _i steps, and the machine can only 
use the space used by the time counter. 

First we suppose that each w n can be computed in space y/n, defining if necessary a new sequence 
where each w n is repeated as long as there is not enough space to compute u> n +i. Now fix T n — T n _i = 
qlV^l _ where q is such that Card ^2)^ x ^/n x Card(Q2) where T2 and Q2 correspond to the Turing 
machine of layer 2. In this way, between T ra _i and T n the Turing machine of layer 2 has necessarily 
enough time to compute w n in space yfn. 

Moreover, at time T n _i the time counter is longer than log 3 (T n _i) > log 3 (q^ n ~ 1 ) (the counter 
uses three letters) and this longer than y/n for q > 5. For layers 1 and 3, the time and space bounds 
are verified asymptotically, i.e. there exists machines satisfying these bounds for n > N. Let tjy be 
the maximal time necessary for those machines to perform those operations when n < N; we can fix 
T n+ \ — T n =tN when n < N, which has no influence on the asymptotic behavior of T n and ensures 
that the machines satisfies the time bound for any n. For the space bound, it is always possible to 
compress the space by a constant factor (by grouping tape cells) so that the space bound is satisfied 
for n < N, with no impact on the computing. 
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3.3.2. Copying 

At time T n+ \ (n > 0), w n has been output on the output layer. If the segment is not merging with 
its right segment, the Turing machine triggers the copying process by copying the first letter of w n 
from the output layer to the copy layer. 

First phase: Inside the time counter, the word on the copy layer progresses at speed -2, and 
a letter at each step is copied from the output layer to the tail of the word; 

Second phase: When the head is out of the time counter, the word keeps progressing at 
speed -2 but the head loses one letter at each step and copies it on the output layer. The 
tail keeps copying letters from the output layer. 

Intuitively, the cellular automaton performs a catterpillar-like movement between the copy and 
output layers (see Figure [7] for an example) . The process ends when it meets a wall or a sweeping 
counter to its left. Thus, w4 copy = B U {#}. 
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FIGURE 7. Beginning of the copying process, with w n = 1101. Only the layers A c , 
and ^.output are represented. The thick line is the limit of the time counter. 



3.4. Merging of segments 

At time T n , all segments of length n merge with their left neighbor, so that the density of walls will 
tend to 0. 

To do so, at time T n —\ is initialized a merging counter of value 2n on the merge layer, decre- 
menting at each step. Since the value of n is written on one computing layer, it is copied (with 
an additional 0) on the merge layer using an auxiliary state [c] (copy). This counter is similar to 
incrementing counters, except it uses -1 as negative carry. 

Simultaneously, a signal is sent to the right and bounces off the right wall. Thus, the segment is 
of length n iff the signal returns before the end of the decrementation. 
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In this case, a symbol \m\ (merge) is created on the merge layer, to indicate the wall will be 
destroyed at next T n ; otherwise, the output will be copied in the output layer as described above. 
Thus Anerge = { — 1,0, 1, |m| , [c] } X {— >, <—} U {#}, see Figure [8] for an example of this process. 
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FIGURE 8. Determination of the length of the segment. Here the right segment is 
of length 3 and the wall merges at time T3. The counter of the right segment has 
been omitted for clarity. 



Fact 7. All left walls of segments of length k are erased simultaneously at time min(Tfc,2 fc + k). 

Proof. Except for the situation described above, the only other way for an initialized wall to be 
erased is a time counter attached to an initialized wall, see Facts [2] and [6j A redundant binary 
counter whose initial value is reaches length k at time 2 k + k. 

□ 

We will consider from now on that n is large enough so that 2 n + n > T n . 
Definition 9. Denote 

r* k = jx G At 1 : [0, 1] is included in a segment of F t (x) of length /c j 

rU = U r M and rt i= r l>i 

i>k 

Proposition 6. Let \i G M^- era {A z ). For all l,keN, one has ^{T\ >k ) — > 0. 

Proof. When t > T n , no configuration can contain a segment smaller than n. Moreover, by a- 
ergodicity, since \i (j_\T]\ Die [in] ° % ([IXI])) ^ 0> there exists //-almost surely segments of length 
larger that n at t = and those segments survive up to time T n . 
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Therefore, the cell is ^-almost surely included in a segment at time T n . Since this segment is 
longer than n and by <r-invariance, the probability that [0, 1] crosses a border of the segment tends 
to as n tends to infinity. □ 



3.5. Correctness of the cellular automaton 
3.5.1. Acceptable segments 

Proposition 7. T n = 0(Lv^j9 ) where q is defined in Section \3. 3.1\ 

Proof. T n = YZ=i T k ~ T k -i- Since asymptotically T k+1 - T k = gLv^J ; an d : 

Lv^J-l n Lv^J 

(2[^n~\ -l) g Lv^J-i < (2k+l)q k < Y^l 1 ^ < ^2@k + l)q k < (2[VnJ + l)q^ +l . 

k=l k=l k=l 

the proposition follows. □ 

In Proposition [6j we did not control the length of the segment at time t. In order to approximate 
a measure, we want to ensure the copy is finished before the time T n+ \, which requires that the 
segment is not too large. 

Definition 10. Let x 6 A z , [i,j] a segment at time t 6 [T n ,T n+ \]. It is acceptable if j — i — 1 < 

K n = y/T n+ i - T n . For n large enough K n = q 2 . 

Proposition 8. For t large enough, an acceptable segment is swept. 

Proof. When T n < t < T n +i, for an acceptable segment of length k, we have k(l + log A;) < 
K n (l + log(A" n )) = o(T n ) by Proposition Taking n large enough, we conclude by Fact [53 □ 

Proposition 9. Let fj, € Ai^ mix {A^). One has ^,{Tjy K ^) — > 0, that is to say: 
n{{x G A^ : [0,1] is in an acceptable segment of F t (x)}) — > 1 

t— >co 

and the rate of convergence is exponential. 

Proof. Any segment at time T n corresponds, at time T n —i, to a segment of arbitrary size plus an 
arbitrary number of segments of size n. For I < n, define 

A* na = {x £ A Z : starting from j there is a strip of a consecutive segments of size n in F t (x)}. 

Take any L > In, and depending on whether there are more or less than [— J such segments of size 
n at time T n _i (see Figure [9] for an illustration of this decomposition), one has: 

fc-1 



i=-L+l j=0 

^) * ( r ?>*) ^ L ^ ( r &- l) + to ( A 54!J ) 
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Thus we try to bound the value of /x(Ag na )- If x G Aq n then for all i E [0, a] one has Xi n = \Y\ 
(corresponding to initialized walls at time t). For any m > 0, by considering one symbol out of m: 



(2) 



M (A^) < m H <7 * n ( [00] ) 

\ie[0,a] 
/ 



n ^ m ([H]) 

<(l + ^(mn))^V([0])^ J+1 



time 



m • n 



< » 
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FIGURE 9. Illustration of the proof of Proposition [9] with a = 9 and m = 3. 



Now take any M > n. Using ([2]) with m = [— ] inside equation (pQ): 



1 + [ n ■ 



M 



n 



< (rJ|- fc L £ ) +*[(! + ^(M))/i ( [m])] 



Now, if fc > nL, we obtain by induction: 

(3) M(r^ fc ) <^fe_ ni )+tt n [(i + ^(M)) M ([0])] 

For the left-hand term, we have: 



m (r?,>*-*iXaO) < m U ^([H 

\j e[— fc+nL,— 1] ie[0,fc-nL] 

U n 



CT 



y e [-fc+nL- 1] ie [ ,L*^p. 



< (A; - nL)(l + ^(n))^H/i ( [[J] 



(urn 
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the second line being obtained by considering one symbol out of every n. Putting M = n, L = n 2 \/ri, 
and k = K n in ([3]) then since Vv( n ) ~~ ^ 0> we have fi(TT n K ) — y and the rate of convergence is 
exponential. □ 

Remark 2. Remark that it is possible to take any value for K n as soon as K n = u}(n 2 ^/n). 
3.5.2. Density of auxiliary states 

Proposition 10. Let fj, € M f ^!! erg {A z ) and u G B l for some fixed I. For a given segment length k 
such that n + 1 < k < K n one has: 

• Ifte [T n + k,T n+1 ], 



/x(F-*(H)|rg)-^ n (H) 



o 



Ift€. [T n , T n + k] one has 



l,k 



w 
w 
w 
w 
w 
w 
w 
w 
w 

FIGURE 10. Illustration of Proposition [TUJ The output is not correctly written in 
dashed areas because of the destruction of a wall. 

Proof. We write rj^" +fc j = {i£ A z \ [i,i + k] is a segment at time T n }, so that 

-l -l 
r S= U r & +fc+ i]= U ^hl,*]) (disjoint union). 

i=—k J r l i=—k-\-l 

Suppose x S P[^x £]• Since such a segment is acceptable, it is swept, and any non-initialized counter 
or wall has been destroyed. Since \w n \ < y/n (smaller than the computing space), the copying 
process will use less than y/n auxiliary cells. 

First point:The copying process progresses at speed one, so at time T n + k the copy of the word is 
finished (since T n + k < T n+ i), and the segment is constituted only by copies of w n except for the 
time counter and computation area (0(\/n) cells) and a merging signal (one cell). 
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Therefore for all x E r7™, ,,, one has 

[-l,k]' 



Card({»e[0,fc-l]:.F t (x)e[«] j }) n 



Q(v^) _ n fj-^l 



taking into account the time counter, the merging signal, and the last copy of w n in the segment 
which can be incomplete, and since k > n. Thus we have 

rE^O^MO I T f-iM) -C(M) =o 



i=0 



Since [i is a-invariant, fi (F t ([u] i ) \ ^f\ k ]) = M ( F *([u]o) | T^. +fc+1] J. So: 



-1 



MHHo) I r 



i=-k+l 



b E " ( F ^(No) I r|« +fe+1] 



by (T-invariance and disjoint union of rjjj. The result follows. 

Second point:When t £ [T n ,T n + k], the copy is still taking place, with t — T n cells containing 
copies of w n and the rest containing copies of w n -i, except for: the computation part, the copy 
auxiliary states, the merging signal, and possibly defects when a wall has been destroyed at time 
T n (there are at most ^ of them) . Therefore 



#{i € [0, k - 1] : F\x) € H} fk -(t-T n ) ~ t-T n ^ n 

— j, Vi(M) + — k(N) 



k 



\0(^/n) ■ ~ = O smce k > n. Using the same reasoning as the first point, we conclude. 



□ 



3.5.3. Proof of Theorem Q]- first point 

Let fi 6 A4^ mix (A z ) and u 6 £> i+1 . Since at time T n there are no segment of length less that n, and 
by Propositions [6] and [9J one has max|/i ^Un<fc<x n ^IfcJ '■ F n < t < T n +i| — )• 1 exponentially 



fast. Therefore: 



and T lk = T, ? since no segment is destroyed between T n and T n +i. By Proposition [TO 
*&([«]) - EM^) (max (o, tdt^Tj C(N) 



max 

T n <i<T n +i 



+ mi ' J ' fc T " ' ' >,r,; 



max 

T„<t<T n +\ 



FtKH) - {fn(t)5 Wn ([u]) + (1 - / n (i))<W-i(M) 
where / n is the piecewise affine function defined by 
/„: [T n ,T n+l ] — > [0,1] 
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k-(t-T n ) 



The second term is chosen so that f n {T n ) = and f n {T n+ i) = 1, but it converges to exponentially 
and thus does not affect the equation. Therefore 



T„<t<T n _|_i 

Since / n is — Lipschitz on [T n ,T n+ i], we deduce that 



max 



d M (u, {F^ \T n <t< T n+l }) = O (-j=\ 



We conclude that V(F,/j,) = V((uvJneN)- 

When w n is computable in space y/n, by Proposition [7] we find that the rate of convergence is 
dM {Fl(i,V ({w n ) ne x)) < O f l g(t) J +SUp | dM : v e U [<W><Wl 

^ n>n(t) 

where n(t) = 0(log(t) 2 ). We recall that w n can always be computed in space \fn by repeating 
elements. 

3.5.4. Proof of Theorem Q]- second point 

Assume that V((wj)igN) = an d let F be the cellular automaton associated with this sequence 
described above. Consider fi G ■M^~: erg (A ) and u G £>' +1 . Since is not assumed to be ^-mixing, 
Proposition [9] does not apply, and there is no guarantee most segments are acceptable. However, 
since fi is ergodic, so is F^fi for all t, and n{F\ >k ) — > 0. 

'— k— >oo 

CLAIM 1: fi(F t (x)o G ^4\^) — > 0, i.e., the density of auxiliary states tends to 0. 

t—>co 

Proof. Suppose we are in an initial segment of length k. Detached time counters, Turing 
machines and merging counters initially present are destroyed in less than k steps. Similarly, 
left merging signals and copy auxiliary states initially present progress at speed -1, so they are 
destroyed before time k. An uninitialized wall is destroyed after k(l + logfc) steps at most, 
and any counter attached to it are destroyed after less than k more steps. For all those states, 
the probability of apparition after time t = k(2 + log k) is less than ^t(r5, fc ) — > 0. 

— k— t>oo 

At time T n , all segments are longer than n, so the density of walls is less than -. Further- 
more, the density of auxiliary states that have been generated by an initialized wall inside 

each segment is O (^~^J ■ 

Only uninitialized sweeping counters and right merging signals remain. Inside each segment, 
call non-swept area the interval between the initialized sweeping counter of the left wall and 
the rightmost cell containing one of those two states. At each step, this area loses one cell to 
its right but may grow by one cell to its left. Notice that merging with other segments cannot 
increase this area since segments of length n at time T n are swept (see Figure [TT]) . 

At time T n , a segment can contain a non-swept area longer than s/n only if it is issued 
from a segment longer than \fn initially, and the non-swept area of other segments have 
a density smaller than -7=. By <r-invariance, fi({x G A z \ xq is in a non-swept area}) < 

4= + /i(r° r ) — > 0. 



Therefore, for o G A\B, we have F*fi([a])} — > 0. 

t— >co 
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O Claim 1 




FIGURE 11. Illustration of the last part of the proof of Claim 1. Slanted lines are 
sweeping counters and grey areas are non-swept. 



time 




k 



FIGURE 12. Illustration of Claim 2. When t > T n + k, the segment is a succession 
of stripes containing w n , w n +x, . . . plus a negligible part of auxiliary states and de- 
fects. 

Claim 2: For any n G N, we have for t large enough dj^i (F*fj,,Conv((wi)i> n )) — > 0, where 

~ t—^oo 

Conv(X) is the convex hull of the set X. 

Proof. Consider a segment of length k at time T n . At time T n + k the copying process for w n 
will be finished, but since the segment is not necessarily acceptable, other copying processes 
may have started in the same segment. Therefore, the segment will be constituted by: 

• a negligible number of auxiliary states; 

• strips containing repeated copies of w n , then w n +i, w n +2- ■■ separated by ongoing copy 
processes (the number of auxiliary copy states being negligible). See Figure [121 

Since the density of auxiliary states tends to 0, and /u(r, — > 0, for all e > it is 

possible to take k large enough so that dj\4 (Fj ,n+k /j,,Conv((wi)i> n )) < e. O ciaim 2 
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The second point of the Theorem Q] follows easily from Claim 2. 



The use of a convex hull gives the intuition that, in order to reach a set of limit measures with 
this construction (instead of a single measure), we have to control the length of the segments, which 
requires ^-mixing; hence we do not hope to extend this proof without modifying the construction 
in depth. 

4. Related problems solved with this construction 

In this section, we use the construction developed in Theorem Q] in view to solve natural problems 
concerning accumulation point of the iteration of a cellular automaton on an initial measure. 

4.1. Characterization of the set of limit points 

4.1.1. The connected case 

Reciprocals of the computable obstructions described in Section [2] follow directly from Theorem [TJ 

Corollary 1. Let v G M. r a comp (B^) be a recursively computable measure. There is an alphabet 
A D B and a cellular automaton F : A z —¥ A 1, such that for any G A4^ U erg (A z ) , one has 
limn-^N F* A* = v ■ 

If we furthermore suppose that the initial measure [i is computable, this is a full characterization 
of reachable measures v. 

Proof. Combine Proposition [JJ with the first point of Theorem [TJ □ 

Corollary 2. Let V C A4 a (B 1 ') be a compact, £2- computable and connected (E^-CCC) subset of 
MfjiB^. There is an alphabet A D B and a cellular automaton F : A 1, — > A z such that for any 
H G M f f mix (A Z ), one has V(F, M ) = V. 

If we furthermore suppose that the initial measure \x is computable, this is a characterization of 
reachable connected subsets V. 

Proof. Combine Proposition [5] with Theorem [TJ □ 
Open question 1. Is it possible to improve the speed of convergence? 

4.1.2. Towards the non-connected case 

In Corollary [2] it is assumed that the set is connected. It is due to the fact that in the construction of 
Theorem [TJ the words (u> n ) ng N are copied progressively and not instantaneously on each segment, 
so that we get the closure of an infinite polygonal path, which is connected. However, we get 
topological obstructions even if we consider a non-connected set of limit points. For example, if 
V(F, n) is finite one has the following proposition. 

Proposition 11. Let F : A z —> A z be a cellular automaton and fi G M. a (A ) such that V(F,fi) is 
finite. Then F* induces a cycle on V{F,n). 

Proof. Let d = min{<i J v!(^, z/) : u, v' G V{F,jj) with v 7^ z/} and consider v G V(F, fj,). It is 
possible to extract a sequence (n^jgN such that djuiF* 1 < | and ci_M(i ? ™ ,+ \x,v) > Since 
d M (F?P,V(F,n)) — ► 0, we have d M (F^fx,u) — »• 0. By continuity of F„ d M (F" i+1 /x, F*v) — ► 

n— >oo i— >cxd i— >oo 

0. 

One deduces that for all v G V(i ? , \i) there exists v 1 G V(F,fi) such that F^v = v' . So there is 
k G N such that V(F, (j) = {vq, . . . , Vk—l) an d F*Vi = h>i+\ where the addition is modulo k. □ 
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We exhibit some examples of more sophisticated behaviors based on the construction in The- 
orem [TJ The first one is a family of cellular automata where V(F, /i) is a finite set of connected 
components, which is a partial reciprocal of Proposition [TTJ The second one is a family of cellular 
automata where V(i ? , /x) has an infinite number of connected components. However these are not 
total characterizations of the possible set of limit points. 

Example 1 (Finite set of connected components). Suppose V = {uq, . . . ,i/k-i} C A4 a {B z ) is a 
finite set of u-invariant recursively computable measures such that Gvi = Vi+\ for some periodic 
cellular automaton G : B z — > B z . Then there is an alphabet A D B and a cellular automaton 
F : A z ->■ A z such that V(F,/jl) = V for fi £ M a A ^ eTg (A z ). Indeed, if F is the cellular automaton 
which realizes vq obtained by Theorem [TJ consider the cellular automaton that applies G on the 
output layer and applies the local rule of F once every k steps if an auxiliary state appears. 

The same idea holds if V is a finite union of E2-CCC sets which are mapped by a periodic cellular 
automaton G : B z — > B z . 

Example 2 (Infinite set of connected components). We give some informal elements to modify the 
construction of TheoremJJto get examples of cellular automata where V(F, fj,) has an infinite number 
of connected components. The construction uses the firing squad cellular automaton (Bfs,Ffs) 
which has the following properties: there exists four states { <~F\ , |_ J > !__' > \0\ } ^ ^fs such 
that if X[q jTl ] = Q !__! " 1 | then i ? FS l ( 2; )[o,n] = '^F\ ™ + an d t ne state <~F\ does not appear in 
( f fs ) (ij)e[o,n] x [0,2n-l] |Maz96j . 

Consider a computable family (Vj)jgN of S2-CCC subsets of A4 a (B 1 ') and assume that ViDVj = 
for all i,j € N. There is a computable sequence of words (ifn)neN such that the limit points of this 
sequence are exactly V = UieN^- Consider the cellular automaton (A^,F) given by Theorem Q] 
which produces V((w' n ) n& n) where w' n = w n x | ] with A D B x B^s- We modify F to obtain 
F in the following way: 

• at time T n , when the copy of w n is initiated, we initialize a counter on another layer to 
count the length k of the segment; 

• at time t = T n+ \ — 2k, the state £ appears on the left border of the segments (remember 
that the time counter keeps track of time) ; 

• All <~F\ symbols are immediatly transformed into | | symbols. 

This requires the segments to be short enough, but the probability that [0, 1] belongs to such a 
segment tends to 1 as time tends to infinity (see Remark [2]). In those segments, F*(jl approximates 
the measure 5 Wn x S _ at time T n+ i and the measure 5 Wn x 5 — at time T n+ \ + 1. The state 'p\ 
appears only at times (T n ) ng N- 

For an initial measure \x £ A^^ u mix (^l z ), one has V(F, fj,) = Vx6~~^~UV' with V C M a ((#fs \ { [F\ }) 
which means it has an infinite number of connected components. 

Open question 2. Is it possible to characterize all compact subsets of A4 a (A z ) that can be reached 
as sets of limit points of the iteration of some cellular automaton on a computable initial measure? 

4.2. Cesaro mean 

In this section, by adapting the enumeration (w n ), we are able to get some control over the set 
V'(F,(J,) of limit points for the Cesaro mean sequence. 

Corollary 3. Let B be a finite alphabet and V C A4(B Z ) a S2-CCC set. There exists an alphabet 
A D B, and a cellular automaton F : A — )• A such that for any fi E M^ U mix {A 1 '), one has 
ViF.f) V. 
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Remind that the latter is the set of limit points of the sequence ipf (fi) = ^rj Si=o-^*A*- "^'(-^A 4 ) 
is necessarily connected (because M(A Z ) is metric and compact), and if we suppose that the initial 
measure [i is computable, we obtain a full characterization of reachable subsets V' . 

This corollary is a consequence of the following stronger result, where we have control over both 
V(F,/i) and V'(F,fi). 

Corollary 4. Let B be a finite alphabet and V' C V C A4(B Z ) two S2-CCC sets. There exists an 
alphabet AD B, and a cellular automaton F : A — > A such that for any fi E A4^ U jnix (A 1 '), one has 

. V{F,n)=V; 
• V[F,n) = V. 

V'(F, jj) is necessarily included in the convex hull of V(F, fj,). Here we need a stronger hypothesis, 
namely, that it is included in V(F,fj,). Therefore, if we suppose the initial measure is computable, 
this is a characterization of reachable pairs of connected subsets (V, V') such that V' C V. 

Proof. We will use notations from the proof of Proposition [SJ Notably (w n )neN an d (w' n )neN are 
the computable sequences of words associated to V and V', respectively, and V^, = {1 6 : 3T < 
t, ft(x) < 2ctk}, where f t is a computable sequence satisfying dy = liminf f t . Without loss of gen- 
erality, suppose that max(|tt; n |, \ w' n \) < y/n for all n (repeating some words if necessary). 

We will define a new sequence of words (u;") ng pj in the following manner, using a similar method 
as Proposition [5j For i 6 N, let be the maximal value such that IJreV* ^( r > a «i) nas a P a th- 

connected component containing 5 w /,5 Wi and <W +1 - Let Pi : [0,pi] — > V„. be a path in V^. such 
that 

- d M (5 Pi ( k ),5p i{k+ i)) < a TH for all k G [Q,Pi - 1]; 

- dM(<5 Pi ( ), <5 W /) < a ni , d M (5 Pi ( Pi ),5 w / i+i ) < a ni ; 

- there exists k G [0,Pi] such that dj^i^dp^), S Wi ) < a ni . 

Since there are less than \A\ ni elements in V^., this path is of length |Pj| < 2\A\ n > < 2\A\\ W ^ < 2\A\\ 

For n G [\Af , |-4| (m)2 ], we define: 

- if n< \Af + \F\, w^ = P t (n-\Af)- 

- otherwise, w'^ = w' i+l . 

and let F be the CA defined as in Theorem [TJ Since all elements of w n are enumerated as in 
Proposition [5l we have V(F,n) = V(u>") = V. 

A B C 



w'i-i T \ A f T \Af+\Pi\ w * T \A\^ 2 

Figure 13. Intuitively, we prove A + B < C, then B < A. 

We have 

|^|(m) 2 _ (|^ 2 + |p.|) \A\^+ 1 ) 2 - \Af +1 i^oo " 
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In other words, the subset [0, \A\ l + \P%\] is (asymptotically) of negligible density in [0, |»4.|( i+1 ) ]. 
Since T n+ \ — T n is an increasing sequence, the subset [OjTj^.^Mp.i] is of negligible density in 

[0,Tj_ 4 | (l+1)2 ]. This means that, putting U = T^ {l+1) 2 , d(<p(.(n), ) . — ► °- 

Furthermore, notice that for x,y G R+, when y < y/x, we have [-^/x + y\ < \_-\fx\ + 1 and 

2 A* 2 +\Pi ^ 1 1 ' 



T \Af> T \Af- T , Af _^>^~ 1 -\^ 
where q is defined in Section [3,3,11 and therefore 

T \Af+\P t \ ~ T \Af ) Q 
I i^| i2 +|F i | 

This means that, when i' = T, AH 2 , , p h d((pE(u), S w /) — > 0. 

The Cesaro mean sequence <pf(fM) is (asymptotically) close to 5 W / between times ij and t^, and 
is close to S w i at time ij+i. Therefore, it is close to the segment [5 w i, 5 W '. +1 ] between times ij and 
U+l- We conclude that asymptotically, the sequence is close to V((w' n )), and thus its set of limit 
points is V'. □ 

Open question 3. Is it possible to extend Corollary^ when V' is not included in V? 
Using Example Q] we can only provide some examples where V([F, fi) n V'(F,fj,) = 0. 

4.3. Decidability problem 

We give an undecidability result extending a result of Delacourt on //-limit sets |Delll| . 

Corollary 5 (Rice theorem on sets of limit measures). Let P be a property on set of measures that 
is nontrivial on non-empty Y^i-CCC sets (i.e. not always or never true). Then it is undecidable, 
given a CA F : B 1 -»• B 1 , whether V{F,n) satisfies P for \i G M^ m JB z ). 

Proof. We proceed by reduction to the halting problem. Since P is nontrivial, let Vi and V2 be 
two S2-CCC sets that satisfies and does not satisfy P, respectively. By Proposition there exists 
two computable sequences of words (u) n )neN, (^4)neN £ (-4*) N such that Vi = V((w n ) n ^), V2 = 
V(«)„ eN ). 

Now let 7~.M be a Turing machine. Define the sequence (tu^) ng jsj in the following way: 
• If TM halts on the empty input in n steps, u>" = w r , 



-'n ■ 



Otherwise, = w' n . 



This sequence is computable by simulating n steps of the Turing machine and computing the 
corresponding sequence. Therefore, we can use the previous construction to build a CA F such 
that V(F,n) = V((u^)neN)' If TM halts on the empty input, then = w n for n large enough; 
otherwise, w'^ = w' n for n large enough. Thus, V(F, fx) satisfies P if and only if TM halts. □ 

The same reasoning holds for a single limit and the Cesaro mean sequence. 

Corollary 6 (Rice theorem for one limit measure). Let P be a property on measures that is nontrivial 
on recursively computable measures. Then it is undecidable, given a CA F : B z — > B^, whether 
— > v where v satisfies P for fj, G Ai^ U mix {B 1 ') . 
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Corollary 7 (Rice theorem on Cesaro mean). Let P be a property on set of measures that is 
nontrivial on non-empty H^-CCC sets. Then it is undecidable, given a CA F : B z — > B z , whether 
V'(F, fx) satisfies P for » G M f ^ mic (B z ) . 

4.4. Computation on the set of measures 

In this section the construction developed in section [3] is modified to perform computation on the 
space of probability measures. In other words, we want the set of limit points to be a function of 
the initial measure. 

4.4.1. Computation with oracle 

The definitions of Section [2] can be adapted to compute with an oracle fi G Ai a (A z ). 

A Turing machine with oracle in Ai C Ai a {A z ) has the same behavior as a classical Turing 
machine but it takes initially in argument a measure fi G Ai as oracle. During the computation it 
can query the oracle by writing u G A* and n 6 N on an additional oracle tape and entering a 
special oracle state. Then the oracle writes an approximation of with an error 2~ n and the 

computation continues. 

Let Ai C Ai a (A 1 ') and X,Y two enumerable sets. A function / : Ai x X — > Y is computable 
with oracle in Ai if there exists a Turing machine with oracle in Ai which takes as input x G X 
and returns y = f(/j,,x) G Y, up to reasonable encoding. 

Definition 11. Let Ai C Ai a (A z ). 

A function tp : Ai — > Ai a (B z ) is computable with oracle in Ai if there exists / :MxN — > 
B* a computable function with oracle in Ai such that \<p{fi) — 5/f^ )Tl )| < 2~ n . This is an extension 
of the previous definition where the image is not countable, hence the abuse of notation. 

A function ip : Ai — > M. (T {B 1 ') is recursively computable with oracle in M. if there exists 

/ : Ai x N — > B* a computable function with oracle in Ai such that Sf( un ) — > y(ii). 

A sequence of functions (f n : Ai x Ai a (A z ) — > M) n eN is a computable sequence of functions 
with oracle in Ai if 

• there exists a : Ai xNxNx4* — > Q computable with oracle in Ai such that / n (/i, S w ) — a(/x, n, m, w) < 

i for all [i G M, w G A* and n, m G N; 

• there exists 6 : Ai x N — >• Q computable with oracle in Ai such that dpiiy^i/) < b(fj,,m) 
implies |/ n (/U, ^) — / n (^> — m f° r all G .M and n, m G N. 

Let ^ be a set of compact subsets of Ai a {B' L ). A function V : — > ^ is S2 _ computable with 
oracle in Ai if there exists a computable sequence of functions (f n : Ai x Ai a (B z ) — > M) n eN with 
oracle in Ai such that dyt^iy) = limin^^oo for all v G .Mo-(£? z ). 

The proofs of Section [2] can be easily adapted in this framework. For any cellular automaton F 
on A z , one has: 

• following Proposition [3l the function fi i — )• i^/x is computable with oracle in Ai a (A z ); 

• following Proposition HI fi i — ?■ V(F,fi) and /i i — ^ V'(F,fi) are ^-computable with oracle in 

• following Proposition [5J if V : — > ^ is a S2-computable function with oracle in Ai and if 
every element of K is connected, then there exists a computable function / : Ai X N — ► .4* 



with oracle in such that V(fi) = V((/(/i, n)) neN ) = nw>oUn>iV &f(ji,n),Sf<jt,n+i) ■ 
4.4.2. Towards a reciprocal 

In this section, we give a partial reciprocal to the last fact. To use the initial measure fi G Ai a (A z ) 
as an oracle, we need to keep some information from the initial configuration. We adapt the original 
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construction in the following way: 



Each segment keeps a sample of the initial configuration, using the frequency of patterns this 
sample as an oracle in the computation. We need to ensure that the frequency of a pattern u G A k in 
this sample is close to //([«]) with a high probability. For n G Ai^ mix (A z ) we have an exponential 
rate of convergence for every length (Theorem III. 1.7 of |Shi96j). More precisely: 

/ f y ]\ n fU \ Card(A) fc nc£ 2 

H l<j x G A : max{|/i([n]) - Freq(«, X[ 0>n ])| > e} > j < (k + m)ip(m)k + lj 2 « , 

where m G N, c > and Freq(u, X[o jn ]) = ^yCard (i G [0,n] : x^j+^^a.] = u) is the frequency of 
u in the pattern X[o, n ]- 

However, in our case, not all the information in the initial configuration can be kept since sweeping 
destroys information in the segment. Therefore, we will keep partial information from 7T/U, where 
7r : A z — > {0, 1} Z is a 1-block map such that 7r ([Y]) = 1 and 7r(a) = otherwise. Even then, 
remembering the frequencies of all words takes too much space, so we just keep information about 
words of a fixed length. Here, each segment will keep in memory only the density of one letter, but 
it is possible to adapt it to keep information on longer words. 

Theorem 2. Let V : -A"!^ ^ ({0, 1} Z ) — > be a ^-computable function where 8. is a set of 
compact connected subsets of A4 a (B z ). Assume that V(/i) = V(fi') if = ^'([1]) f or H,^' G 

There exists a cellular automaton (A Z ,F) such that V(F,fi) = V(7r/i) for all /x G A4 a - m i x (A z ) 
where it : A 1 * — > {0, 1} Z is the 1-block map defined above. 

Notice that since only one density is considered, it would be equivalent to consider a In- 
computable function M — > A. 

Proof. Let / : Ai x N — > A* be a computable function with oracle in M such that V(/i) = 
V((f(n, n)) ne psj) and consider the associated Turing machine with oracle. 

Consider the cellular automaton defined in the proof of Theorem [TJ We add a new layer -4, racie 
in which each segment at time t stores the frequency of the state \Y\ in this segment at time 0. To 
do that, we modify the construction in the following way: 

• We subdivide the layer ^. racie in two parts, on which each wall [w] keeps on its left: 

— the first counter for the number of \Y\ symbols that have been destroyed in its left 
segment; 

— the second counter for its length, worth if the segment is not swept. 

• Another counter accompanies each sweeping counter, measuring the length of the segment 
as it progresses. 

• The second counter is initialized as 0. When the time counter attached to this wall makes 
a comparison with an initialized sweeping counter (the comparison returns the result "="), 
the second counter stores the length of the segment. It may take the value again after 
merging with a non-swept segment (see below). 

• When a wall is destroyed by a merging process, it sends to its right a signal at speed 1 
containing all the stored information. Such a signal should not cross a sweeping counter, so 
it is slowed down if necessary. 

• When a wall has stored (ci,C2) as oracle and receives the signal (c'^c^) from its left, there 
are three cases: 

— If 02 = 0, the left segment was not swept, the signal cannot come from an initialized 
wall and can be safely ignored. The oracle remains (ci, C2). 

31 



— If C2 7^ 0, the information comes from an initialized wall. Put c" = c\ + + 1 to take 
the merging into account. If c' 2 = 0, the segment just merged with a non swept segment 
and c'2 = 0; otherwise d 2 = C2 + c' 2 - The new oracle is (c'/, c^). 
See Figure [J5J We remark that if the length of the segment is k, the information can be 
coded in space log(fc), and it is possible to actualize the values before another signal can 
come from the left. 



time 




k n 



FIGURE 14. Each wall has its counter displayed when its value changes. Slanted 
thick lines are sweeping counters, dotted lines are signals transmitting information. 

• If two symbols TJ are too close in the initial configuration, they are destroyed by the 
bootstrapping process (see Section 13.2. lj) . If a TJ is in a group of TJ separated by two 
cells or less, the rightmost TJ sends a sweeping counter and the leftmost one starts a time 
counter. Thus a group of TJ separated by two cells or less behave as a single symbol for 
initialization purposes. All the TJ except the leftmost one are transformed immediately into 
oracle signals (supposing the basis of the counter is larger than 3 then they occupy only one 
cell) and the other cells present initially are erased. 

• The Turing machine simulation described in Section 13.3.11 can be adapted to simulate a 
Turing machine with oracle. When there is an oracle query for the value of £ l ([TJ]) with the 
precision 2~ l at time t E [T n ,T n +i], there are two possibilities: 

— if n~e < 2~ l , the Turing machine uses the information stored in the oracle layer to 
return the frequency of TJ on the segment at time 0, and this corresponds to an 
approximation of /u([TJ]) with sufficient precision; 

— if n~(i > 2~ l , the computation stops, and the previously computed word will be copied 
at time T n+ \. The same thing happens until time Tj, when enough information will be 
available. 

Let us check that V(7r/z) = V(F,[i) for G A4^ mix (A z ). It is clear that the density of auxiliary 
states tends to 0, so if the sample approximates correctly ^([TJD, the sequence of words (w n )neN 
produced by the cellular automaton correspond to (/(/x, n)) ne ^ U P to some repetition. Thus we 
only need to prove that the probability that a cell belongs to a segment which sample correspond 
to a "bad" approximation tends to when t tends to 00. Recall that r^-i = {x £ \ is a 
segment at time T„,}. 
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B„ = fi (J^x E A^ : xq belongs in a segment with a "bad" sample at time ^nl^ 

j<o,i>o 



fc>0 

by cr-invariance. By restricting ourselves to n < k < K n : 

Kn 



k=n 



— »• 0. 



n— >oo 

The result follows. 



□ 



This result may seem surprising since the same cellular automaton has very different asymptotical 
behaviors depending on the initial measure. 

Open question 4. Is it possible to improve Theorem^ and characterize functions f : A1^ mix ({0, 1} Z 
where 8. is a set of compact subsets of M. CF {B 1 '), that are realisable as the action of a cellular 
automaton F in the sense that for all fx, V(F,fi) = f{n)? 

5. Removing the auxiliary states 

In this section, our aim is to carry the previous results to the case where the cellular automaton 
does not use auxiliary states. A straightforward extension is impossible: for example, consider v a 
recursively computable measure with full support and F : A z — > A z a cellular automaton such that 
F*(x —> v when [i is a "simple" measure. Since v has full support, F is a surjective automaton, and 
hence the uniform Bernoulli measure is invariant under F*. Thus v must be the uniform Bernoulli 
measure. 

However, if the limit measure does not have full support, the previous results can be extended 
by using a word not charged by the measure to encode the auxiliary states in some sense. 

Theorem 3. Let (u> n ) n eN be a computable sequence of words of B* , where B is a finite alphabet, 
and let u E B* a word that does not appear as factor in any of the w n . Then there is a cellular 
automaton F : i3 z — )• B 1 ' such that for any measure fj, E •M^ m ^( J B Z ) , V(F,fi) = V((w n ) n ^). 

Proof. Let A be the alphabet and F be the CA associated to the sequence (w; n )neN by Theorem [TJ 
Our aim is to provide an encoding of any configuration of A z in B 1, and a cellular automaton F' 
that behaves similarly to F after encoding. 

Denote Ud C B d be the set of words of length d beginning with u, that do not contain u as factor 

(except at the first letter), and that do not end with a prefix of u. #(Ud) — > oo, so for d large 

d— >co 

enough, we can find an injection <p : A\B — > Ud, and we extend it by putting <p = Id on B. For a 
finite word, we define (p(u\ . . . u n ) = ip(u\) . . . (p(u n ), and this can be naturally extended further to 
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configurations $ : A i— > B by considering that 9?(ao) starts on the column zero. 

Let T C A 1 ' be the set of configurations such that the word u does not appear on the output layer 
(T is a subshift of finite type). Since u marks unambiguously the beginning of a word of (p(A\B), 
the restriction $ : T — > £> z is injective. 

Each configuration from £> z can thus be divided uniquely into words from ip(A), that we will 
call clusters from now on. Output cells are elements of B = f(B) that occupy only 1 cell 
(corresponding to (b, for b € B in the previous construction) and auxiliary clusters 

are elements of (p(A\B) that occupy d cells while containing one letter of output. Thus we can 
define a decoding ^ : B z — > T such that ^ o = Id. 

However, $ and \]/ are not cr-invariant, so <3? o F o vj/ i s not a cellular automaton. We must build 
manually a cellular automaton on /3 Z that behaves in roughly the same way as <J>o i^o \1'. Provided 
the neighborhood is larger than [— 4<i, 4d], each cell can "read" the cluster in which it belongs, and 
the three clusters to its right and left. 

If a word u appears outside of an auxiliary cluster, it is replaced by some output cells and can 
never be created again. To avoid creating an auxiliary cluster by mistake, we fix to this purpose a 
letter b G B such that b d ^ Ud- Similarly, auxiliary clusters that are destroyed for any reason leave 
behind them output b cells. 

The different parts of the construction are modified in the following way. 

• |TJ and |w| clusters, time counters, and Turing machines have the same behavior as in the 
previous construction. However, since the counters take more space, it is necessary to erase 
3d cells to the left and right of each [TJ cluster at time 0. 

• The copying process progresses to the left at speed one, and behaves normally as long as it 
does not meet another auxiliary state (see Figure [T5|) . 
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FIGURE 15. End of the copying process described in Figure [?1 copying 1101. 

• Sweeping counters progress to the right at speed d. This is too fast to keep the output 
information, so the counter leaves behind output cells b defined above. Any moving signal it 
meets (e.g. copying process or length signal) is destroyed. When entering the time counter, 
if it cannot progress by d cells exactly, it is offset by less than d cells (see Figure [To]) . Thus 
sweeping clusters separated by small offsets are still considered to be the same counter. 
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FIGURE 16. A sweeping counter gets offset when entering the computation / time 
counter area. Notice the output cells are replaced by zeroes. 

• Merging signals which determine length of segments also progress at speed d. To avoid being 
offset by copying processes (which would modify the "measured length"), the determination 
of length starts only after the copy is finished. Thus the signal is only offset once, when 
entering the time counter area. After bouncing off the right wall, it returns to the left wall 
where its offset can be measured. If it takes to steps to return with an offset of a, then 
the segment has length Q ■ d + a (see Figure [IT]) , On the left side of the wall, a Turing 
machine computes the measured length and compares it with n, and a [m] symbol is created 
if needed. 
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FIGURE 17. Determination of length. Here d = 3, to = 8 and o = 1, for a measured 
length of 13. 
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Remark 3. For clarity, in all diagrams of this section, we suppose that B = {0, 1}, d = 3 (it would 
be much larger in real implementations) and we represent auxiliary clusters as blocks with layers, 
instead of words from B d . Also we fix b = in the definition above. 

The bootstraping and sweeping processes work essentially in the same way as previously, except 
that a sweeping counter erases any copy process and merging signal it meets, along with output 
information. Hence Propositions [8] and [9] can be extended. Furthermore, at time t, with T n < t < 
T n+ i, the copy process followed by the process of determination of length for segments of size n + 1 
still take less than T n+ \ — T n steps. Hence the proof in section 13.5.31 can be extended, and the 
theorem follows. □ 

However, because of the destructive nature of the sweeping counter, the proof in section 13.5.41 
cannot be adapted and we cannot weaken the hypothesis to fi G A4^ erg (B' 1 ') when V is a singleton. 
Since this result is a counterpart to the second point of Theorem [1] that does not use auxiliary 
states, it is natural to give similar counterparts to corollaries [2] to [71 

Definition 12. A word u G A* is said to be not charged by a set V G M. a (A z ) if for all v G V, 

KM) = o. 

Corollary 8. Let V C A4 a (B z ) be a non-empty Y<2-CCC subset of A4 a (B z ) that does not charge 
a word u £ B* . Then there is a cellular automaton F : £> z — > £> z such that for any measure 
[i G M f ^ mix (B z ), V(F,fi) = V. In particular, any recursively computable measure which does not 
have full support can be obtained this way. 

Proof. Since V does not charge u, we can assume without loss of generality that no word in the 
computable sequence (w n )neN associated to V by PropositionQ]contains u as factor. Thus Theorem[3] 
applies. □ 

The proofs of the following corollaries are adaptations of the proofs of their counterparts using 
Theorem [3j Of course Corollary Q] does not have a counterpart since its proof uses the first point of 
Theorem [TJ 

Corollary 9. Let V' C V C A4(B Z ) two non-empty Yi^-CCC sets that both do not charge the same 
word u G B* . Then there exists a cellular automaton F : B — > B such that for any fi G ^(A z ), 

. V(F,fi) = V; 
• V{F,fi) = V. 

Corollary 10 (Rice theorem on sets of limit measures). Let B be a fixed alphabet, [i G M^^^JJ^*), 
u G B* , and P be a property on set of measures that is nontrivial (i.e. not always or never true) on 
non-empty T,2-CCC sets that do not charge u. Then it is undecidable, given a CA F : B z — > £> z , 
whether V(F, /x) satisfies P. 

This result extends to single measures and limit points of Cesaro mean sequence, in a similar way 
as Corollaries [6] and [7J 

We leave open in particular the case of limit measures with full support. For corollaries [8] and 
[9j solving this case would imply to characterize the possible asymptotic behaviors of surjective 
automata, for which a similar construction seems difficult. As for Corollary [TUl if we fix [i the 
uniform Bernoulli measure, the problem of whether V(F, fx) contains only the uniform Bernoulli 
measure is equivalent to the surjectivity of F, which is decidable [A P72] . Hence the question 
of which nontrivial properties on limit measures and sets of limit measures with full support are 
decidable remains open. 

Open question 5. Which sets of measures are reachable by surjective cellular automata? 
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